Back

Journal of Medical Genetics

BMJ

Preprints posted in the last 30 days, ranked by how well they match Journal of Medical Genetics's content profile, based on 28 papers previously published here. The average preprint has a 0.02% match score for this journal, so anything above that is already an above-average fit.

1
Whole-exome-based preconception carrier screening in Uzbekistan with targeted SMA, FMR1, and DMD assays: the first reported clinical program

Kullyev, A.; Avdeichik, S.; Akimenkova, A.; Kartuesov, A.; Kardymon, O.; Goikhman, Y.

2026-06-04 genetic and genomic medicine 10.64898/2026.06.02.26354713 medRxiv
Top 0.1%
10.3%
Show abstract

Abstract Purpose: Published clinical outcome data on preconception carrier screening (PCS) in Central Asia are limited. We report the first clinical implementation study from Uzbekistan of a whole-exome sequencing (WES)-based multi-platform PCS program combining exome sequencing with targeted SMA, FMR1, and DMD assays. Methods: We retrospectively analyzed anonymized data from 65 individuals (19 couples, 27 singletons) screened at IMC Genomics, Tashkent, between January 2024 and May 2026. WES covering the protein-coding regions of approximately 20,000 genes was followed by exome-wide bioinformatics filtering and clinical geneticist interpretation. Partly overlapping cohorts underwent SMA carrier screening (n=179), FMR1 CGG-repeat analysis in females (n=155), and DMD deletion/duplication testing in preconception females (n=29). Variants were classified by ACMG/AMP criteria against gnomAD v4.1. Results: Sixty-one of 65 WES-screened individuals (93.8%; 95% CI 85.2 - 97.6%) carried at least one reportable variant (152 instances across 126 genes). Four of 19 couples (21.1%; 95% CI 8.5 - 43.3%) were concordant for pathogenic or likely pathogenic variants in the same autosomal recessive gene; two were referred for preimplantation genetic testing for monogenic disease. SMA screening identified four carriers, including two 2+0 silent carriers; FMR1 analysis identified one intermediate allele; DMD MLPA identified no exonic rearrangements. Conclusion: This first reported WES-based multi-platform PCS program in Uzbekistan was feasible and clinically informative, identifying actionable couple-level reproductive risks and supporting structured implementation of reproductive genetic screening in Central Asia.

2
Integrating enriched case data from national laboratory testing with population-based case-control analyses: a novel statistical likelihood-ratio methodology for PS4 applied to 325,345 breast cancer cases and 671,006 controls

Allen, S.; Rowlands, C. F.; Garrett, A.; Couch, F.; Richardson, M. E.; Pesaran, T.; Pethick, J.; Lavelle, K.; McRonald, F.; Vernon, S.; Torr, B.; Loong, L.; Aungraheeta, R.; Durkie, M.; Burghel, G. J.; Callaway, A.; Robinson, R.; Field, J.; Frugtniet, B.; Palmer-Smith, S.; Grant, J.; Pagan, J.; McDevitt, T.; Snape, K.; Hanson, H.; McVeigh, T.; Loveday, C.; Jones, M.; Hardy, S.; Turnbull, C.; CanVIG-UK,

2026-05-17 genetic and genomic medicine 10.64898/2026.05.13.26353095 medRxiv
Top 0.1%
9.4%
Show abstract

Background: For many evidence criteria within v3.0 of the ACMG/AMP guidelines, methodologies have been developed to empower their use outside the stipulated evidence strengths. However, no such methodology has been established for case-control data (PS4). With the release of large-scale unselected case-control datasets and expansion of nationally-collected laboratory datasets enriched for pathogenic variant carriers, there is potential to combine datasets across ascertainment contexts in a more quantitative manner using novel likelihood ratio tools. Methods: Using our published PS4-LR-Calculator, we calculated a combined log likelihood ratio (PS4-LLR) across five datasets (three unselected, and two enriched), and estimated enrichment of pathogenic variants in clinically-ascertained laboratory data using truncating variant prevalence. Results: Data were combined for 10,817 missense variants from 325,345 female breast cancer patients and 671,006 controls of Western European ancestry for five breast cancer susceptibility genes (BRCA1, BRCA2, PALB2, ATM, CHEK2). A combined LLR was produced for 4,690 missense variants; 927 variants received evidence towards pathogenicity (LLR[≥]1), and 3,242 received evidence towards benignity (LLR[≤]-1). Conclusion: This flexible, variant-level methodology combines nationally-collected 'enriched' datasets with unselected case-control cohorts, expanding the available information for case-control analysis, boosting power, enabling exploration of atypical penetrance and empowering variant classification.

3
Appraising familial prediction of proband outcomes in neurogenetic disorders

Reimer, S.; Wilson, K.; Schaffer, L.; Larsen, I.; Roybal, M.; Rau, S.; Seebeck, J.; Torres, E.; Clasen, L.; Liu, S.; Raznahan, A.

2026-05-22 psychiatry and clinical psychology 10.64898/2026.05.20.26353681 medRxiv
Top 0.1%
8.4%
Show abstract

Abstract Background Gene dosage disorders impact cognition and psychopathology, but outcomes vary widely amongst carriers of the same variant. Recent work has sought to better predict proband outcomes using measures of corresponding traits in family members. However, family-based models have not yet been prospectively quantified across several traits in different genetic disorders, nor evaluated for the precision they afford: both crucial issues for clinical implementation. Methods In a first test case for these questions, we apply regression analyses to quantify and compare family-based prediction of 12 traits (including IQ, autism- and ADHD-related traits) in 433 individuals from families including a proband with XXY or XYY syndrome (N=93 and 58, respectively). Results The 12 traits vary substantially in their proband-family associations (0.001<|r|<0.55) - with differences emerging between XXY and XYY syndrome. Only two traits also show significant and similar proband-family associations in both aneuploidies, with the greatest concordance found for IQ. A family-based model for IQ prediction in male sex chromosome trisomies significantly reduces error vs. a group mean IQ model (F = 7.4, p = 0.006), but only in 65% of probands, and with mean error reduction of ~2 IQ points. Conclusions Family-based prediction of neuropsychiatric traits in genetic syndromes likely requires trait- and syndrome- specific models. Family models can significantly improve outcome prediction for IQ, but to variable degrees across individuals and with a small mean improvement. By mapping and quantifying these limits, our work helps draft a roadmap for refinement of family-based prediction of proband outcomes in gene dosage disorders.

4
Family functioning and psychiatric outcomes in children and young people with intellectual and developmental disabilities caused by rare genetic mutations

Haddon, J. E.; Hall, J. H.; IMAGINE ID, ; Hall, J.; Owen, M. J.; van den Bree, M. B. M.

2026-05-13 psychiatry and clinical psychology 10.64898/2026.05.10.26352561 medRxiv
Top 0.1%
8.0%
Show abstract

BackgroundA range of rare chromosomal micro-deletions or -duplications (Copy Number Variants - CNVs) are associated with high risk of neurodevelopmental and mental health conditions (ND-CNVs). There is great individual variability in outcomes, but we lack insights into the contributing social factors, including family functioning. MethodsCaregivers of 598 children and young people (CYP) with a range of 16 ND-CNVs and 222 siblings without ND-CNVs (controls) completed questionnaires on overall family climate (cohesion and conflict) as well as caregiver-CYP relationship warmth and hostility and took part in a research diagnostic interview about CYPs psychiatric symptoms. CYPs intelligence quotient (IQ) was also measured. ResultsComparisons with published data from neurotypical families indicated that families affected by ND-CNVs are characterised by higher family cohesion and conflict as well as lower caregiver-CYP warmth and hostility. Symptoms of oppositional defiant disorder reduced more steeply in CYP with ND-CNVs compared to controls with increasing family cohesion (interaction effect: {beta} = -0.14, p = 4.65 x 10-{superscript 2}). In contrast, they rose more steeply with increasing family conflict (interaction effect: {beta} = 0.18, p = 1.05 x 10-{superscript 2}). Furthermore, symptoms of mood disorder increased more steeply with increased caregiver-CYP hostility in CYP with ND-CNVs (interaction effect: {beta} = 0.15, p = 4.55 x 10-{superscript 2}). ConclusionsRaising a CYP with a rare genetic condition is challenging. Timely access to interventions that support caregivers in fostering a positive family environment may reduce behavioural difficulties in CYP, with subsequent benefits for family functioning.

5
Biallelic CYB5A disruptions in 46,XY Disorder of Sex Development: Identification and Characterization of a Novel Deep Intronic Variant

Moradifard, S.; LE, T. N. U.; Ha, N. T.; Dung, V. C.; Thao, B. P.; Harley, V. R.

2026-05-12 genetic and genomic medicine 10.64898/2026.05.05.26352416 medRxiv
Top 0.1%
7.2%
Show abstract

BackgroundThe diagnostic yield for 46,XY disorders of sex development (DSD) remains limited. Whole-genome sequencing (WGS) improves detection of both coding and non-coding variants that may be missed by routine testing. Cytochrome b5, encoded by CYB5A, is an essential co-factor for CYP17A1-mediated 17,20-lyase activity. We report on WGS on a Vietnamese family with 46,XY DSD with two siblings presenting with female external genitalia. MethodsClinical assessment and hormone profiling were conducted. WGS was conducted on peripheral blood DNA, in two affected siblings followed by variant annotation and ACMG-based classification. A minigene RNA splicing assay in HEK293 cells was used to evaluate the functional impact of the CYB5A intronic variant. ResultsThe patients hormone profile showed low testosterone and estradiol. WGS identified compound-heterozygous CYB5A variants: a paternally inherited missense variant (p.Val34Glu, likely pathogenic) and a maternally inherited deep intronic deletion (c.129+862_129+863del) for which SpliceAI predicted aberrant splicing. Minigene assays confirmed that the intronic deletion creates cryptic splice sites, resulting in pseudoexon inclusion and a premature stop codon, consistent with nonsense-mediated decay. The intronic variant meets ACMG criteria for pathogenicity. ConclusionThis family expands the spectrum of CYB5A-related DSD and demonstrates that compound-heterozygous variants, including deep intronic defects, can lead to a disruption in 17,20-lyase activity. These findings highlight the importance of WGS and functional assays for identifying clinically relevant non-coding variants in DSD.

6
Differential causative effects of germline pathogenic variants in MUTYH and PALB2 in a patient with colorectal polyposis and breast cancer

Camacho Valenzuela, J.; Pelletier, D.; Polak, P.; Fu, L.; Hamel, N.; Domecq, C.; Ahmed, A.; Robles-Espinoza, C. D.; Foulkes, W. D.

2026-05-25 genetic and genomic medicine 10.64898/2026.05.15.26352890 medRxiv
Top 0.1%
7.1%
Show abstract

Purpose Patients carrying Germline Pathogenic Variants (GPVs) in multiple cancer susceptibility genes (CSGs) can be described within the context of Multi-locus Inherited Neoplasia Allele Syndrome (MINAS). The role of each GPV is typically interpreted based on clinical phenotypes. Here, we used tumor sequencing, particularly mutational signatures, to investigate the contribution of GPVs in MUTYH and PALB2 to colorectal polyposis and breast cancer in a single patient at a molecular level. Methods We analyzed tumor sequencing data, including mutational signatures and genomic scars, of a breast tumor and a colorectal polyp from a patient with biallelic GPVs in MUTYH and a heterozygous GPV in PALB2. Results The colorectal polyp showed a dominant contribution of MUTYH-associated Base Excision Repair deficiency (BERd) mutational signatures, with no evidence of Homologous Recombination Repair Deficiency (HRD). In contrast, the breast tumor showed both MUTYH-driven BERd and HRD-associated signatures, including SBS3, ID6 and an elevated HRD score, despite the absence of a detectable second hit in PALB2. These findings suggest a differential contribution from the CSGs, with MUTYH contributing to both lesions and PALB2 contributing specifically to the breast tumor. The observed pattern does not align with the additive or synergistic models described in MINAS. Conclusions Our study provides evidence that mutational signatures can elucidate the contribution of multiple CSGs to tumorigenesis within a single patient. These findings extend current interpretations of MINAS beyond additive or synergistic phenotypes, which may help to better understand tumor etiology, with potential clinical implications, including eligibility for targeted therapies.

7
The Genetic Landscape and Epidemiological Characteristics of Inherited Retinal Diseases in the Chinese Population

Zeng, B.; Cui, Z.; Zhou, S.; Dai, W.

2026-05-29 ophthalmology 10.64898/2026.05.27.26354224 medRxiv
Top 0.1%
4.8%
Show abstract

Background: Inherited Retinal Diseases (IRDs) are a group of genetically heterogeneous blinding conditions. Major global genomic reference databases are disproportionately enriched for individuals of European ancestry. This underrepresentation creates a significant bias that impedes the accuracy of genetic diagnosis in the Chinese population. This study aims to address this limitation by constructing a comprehensive genetic landscape of IRDs using large-scale deep-sequencing data from a large Chinese cohort. Methods: The study leveraged variant data primarily from 10,588 individuals in the China Metabolic Analytics Project (ChinaMAP) and cross-referenced findings against multiple national and international databases. We systematically curated variants within a targeted panel of 291 IRD-associated genes. Variant pathogenicity was assessed using a comprehensive pipeline integrating InterVar-automated classification based on 2015 American College of Medical Genetics and Genomics/Association for Molecular Pathology (ACMG/AMP) guidelines, ClinVar evidence (review status [&ge;] 1 star), and manual literature curation. We delineated the mutational spectrum, identified population-enriched pathogenic/likely pathogenic (P/LP) variants, and analyzed the distribution characteristics of IRD-associated highly-mutated genes. Furthermore, we calculated the carrier frequencies (CF) and genetic prevalence (GP) of autosomal recessive(AR)-IRD genes in the Chinese population. Results: The study revealed a highly concentrated genetic landscape for AR-IRDs in the Chinese population, with ABCA4 and USH2A emerging as the primary drivers of the genetic burden. This finding aligns with previous Chinese cohorts but contrasts with global databases, where genes such as the X-linked RPGR are more prevalent. In contrast, autosomal dominant (AD)-IRDs exhibited high locus heterogeneity, with pathogenic variants dispersed across numerous genes (e.g., COL2A1 and MFN2). We identified a series of P/LP variants that were either high-frequency or significantly enriched in the Chinese population, such as CNGB1 (p.P530R) and specific recurrent alleles in ABCA4 and CYP4V2. The estimated cumulative CF for AR-IRDs was 1 in 5.60, and the theoretical total GP was 1 in 2,624.67, based on the ChinaMAP data. Conclusion: By integrating the ChinaMAP dataset with diverse genomic resources, this study provides a genetic landscape of IRDs in the Chinese population. Our analysis shows a concentrated mutational spectrum in AR-IRDs, contrasting with the pronounced heterogeneity in AD-IRDs. These findings, including population-specific pathogenic variants and refined prevalence estimates, provide a resource for precision diagnostics, genetic counseling, expanded carrier screening (ECS), and public health policy development in China.

8
Detecting genomic regions enriched for reciprocal recombination in autism spectrum disorder

Mahoney, C. F.; Salter-Townshend, M.; Fitzpatrick, D. J.; Shields, D. C.

2026-05-27 genetics 10.64898/2026.05.26.727863 medRxiv
Top 0.1%
3.9%
Show abstract

Meiotic recombination is an important means of increasing genetic diversity by generating novel haplotypes in a population. Recombination separates linked loci extremely slowly in some regions, therefore genetic variants in high linkage disequilibrium may become co-adapted. Reciprocal recombination that separates co-adapted variants may generate a deleterious de novo haplotype that contributes to disease. We developed statistical methods to detect genomic regions of recombination excess in two different family-based study designs. We identified recombination in the Simons Simplex Collection in 273 simplex families with one child with autism spectrum disorder (ASD) and at least two unaffected children, in which recombinations can be mapped to the proband and contrasted with the recombination counts in unaffected siblings; and in 1,802 families with two children, where the number of recombinations identified can be contrasted with the expectation from a reference recombination map. Both strategies revealed a tail of low p-values for loci of interest that contrasted with the rest of the distribution. Permutation and bootstrap tests did not identify genome-wide primary findings in either cohort, but the most significant three-child cohort locus of recombination excess (between cadherin genes CDH4 and CDH26) replicated in the two-child cohort (p=0.01). While this replication strategy was not defined a priori, five of the most recombination enriched bins identified candidate ASD genes (p=0.02; WWOX, ADAMTS16, INSR, ADARB2, and HS6ST1). Since the six identified loci were not identified as regions of high de novo copy number variation in the study cohort and no CNVs were detected in any of the recombinant probands in the identified regions, they represent candidates for reciprocal recombinations generating unfavourable haplotypes for these genes. This study highlights a previously unidentified source of clinical genetic variability contributing to the molecular aetiology of ASD. AUTHOR SUMMARYAutism spectrum disorder (ASD) is a constellation of neurodevelopmental disabilities characterised by deficits in social communication and repetitive patterns of behaviour. While ASD is highly heritable, its genetic basis is complex and poorly understood. While some highly penetrant types of genetic variation have been identified, most people with ASD carry a large number of variants that each contribute a small amount to their overall phenotype. In addition to mutations in individual genes, changes in the configuration of genes along a chromosome may contribute to ASD. Here, we describe a method for identifying regions where such new configurations have occurred through recombination and attempt to find regions where such changes are more common in autistic children than in their non-autistic siblings. We explore recombination as a source of genetic variation contributing to autism, which has potential to inform clinicians in providing services to autistic people and their families.

9
Comprehensive analysis of de novo variants across 2,497 orofacial cleft trios reveals novel genetic drivers of disease

Kurtas, N. E.; Sanchis-Juan, A.; Shin, E.; Curtis, S. W.; Robinson, K. R.; Lee, A. S.; Alade, A. A.; Zhao, X.; Fu, J.; Diaz Perez, K. K.; Gowans, J. J. L.; Eshete, M. A.; Adeyemo, W. L.; Buxo, C. J.; Padilla, C. D.; Poletta, F. A.; Carreno Torres, A.; Wehby, G. L.; Hecht, J. T.; Moreno Uribe, L. M.; Mukhopadhyay, N.; Shaffer, J. R.; Weinberg, S. M.; Murray, J. C.; Beaty, T. H.; Butali, A.; Talkowski, M.; Marazita, M. L.; Leslie-Clarkson, E. J.; Brand, H.

2026-05-24 genetic and genomic medicine 10.64898/2026.05.21.26352934 medRxiv
Top 0.1%
3.6%
Show abstract

Background Orofacial clefts (OFCs) and other palate abnormalities (PAs) are among the most common birth defects worldwide and are characterized by the abnormal formation of the lip and/or palate. Genetic studies have traditionally classified OFC cases as either syndromic, involving OFCs alongside other congenital anomalies, or nonsyndromic, which represent the majority of cases and occur in isolation. Emerging genomic evidence indicates that genes traditionally associated with syndromic forms of OFC can also harbor variants contributing to isolated cases, challenging the notion of a strict dichotomy between these categories and supporting their integration for gene discovery. Methods In this study, we applied multiple analytic approaches to characterize the genetic architecture of OFC and PAs by integrating genomic data from 2,497 trios with an OFC (n=2080) and PA (n=417) affected proband. We compared these findings across OFC subtypes and syndromic status with those from 5,515 control trios to identify enriched biological pathways and mechanisms and to prioritize candidate genes using variant burden testing. Results We observed a significant enrichment of de novo protein-truncating and damaging missense variants in cases compared to controls (OR = 2.17, p = 1.21x10-32), with particularly strong signals in biologically relevant gene sets involving OFC-associated, constrained, Mendelian disorder, and mouse candidate genes. Variant burden testing identified 39 OFC risk genes at FDR [&le;] 0.05, which we then integrated with 593 established OFC genes to interrogate the functional underpinnings of OFC via network analysis. This analysis revealed 309 high-order interactor genes not previously associated with OFC. Notably, this OFC network clustered into ten distinct biological pathways, with nucleosome-associated genes showing significant enrichment among cases in our cohort (OR = 14.8, p = 8.1x10-4). In a final integrative step, we combined evidence across all analyses to nominate 231 candidate genes, 32 of which contained at least two deleterious de novo variants in our cohort. Conclusions These findings underscore the value of integrating diverse OFC and PA subtypes, syndromic status, and variant classes to refine the genetic architecture of these disorders, highlighting both phenotypic expansion of known disease genes and the emergence of novel gene-phenotype associations.

10
The D4Z4caster DNA methylation signature identifies individuals at epigenetic risk for developing facioscapulohumeral muscular dystrophy (FSHD)

Jones, T. I.; Eriksen, B. Z.; Farooqi, M. N.; Gould, T.; Jones, P. L.; King, O. D.

2026-05-29 genetics 10.64898/2026.05.26.727947 medRxiv
Top 0.2%
2.9%
Show abstract

BackgroundFacioscapulohumeral muscular dystrophy (FSHD) is caused by epigenetic dysregulation at the chromosome 4q35 D4Z4 repeat array under specific permissive genetic conditions. Due to the complexity, expense, and general inaccessibility of FSHD genetic testing, many individuals displaying characteristic muscle weakness are never genetically confirmed and at-risk relatives cannot get screened. We previously developed a targeted bisulfite sequencing (BSS) protocol using the Sanger method to determine DNA methylation levels at specific D4Z4 loci relevant to distinguishing forms of FSHD from non-FSHD that can be used with DNA isolated from saliva, thereby reducing cost and increasing accessibility compared to traditional D4Z4 deletion testing that uses DNA isolated from blood. MethodsHere, we adapt the D4Z4 BSS protocol to next-generation sequencing (NGS) to increase sequencing depth and further reduce cost, validate both sequencing technologies against several cohorts of genetically defined samples, and introduce the D4Z4caster software for computing DNA methylation signatures with diagnostic utility from raw sequencing data. ResultsBoth Sanger and NGS BSS methods using D4Z4caster were validated as providing high sensitivity and specificity, with geometric mean of sensitivity and specificity (G-mean) >95% and area-under-the ROC curve (AUC) of 0.99. The NGS method allows for higher throughput and increased read depth, while the Sanger method allows faster processing of individual samples. Importantly, the NGS method could identify FSHD1 cases that are likely mosaic and would otherwise be missed. ConclusionsD4Z4caster methylation signatures can accurately detect contracted FSHD1-permissive chromosome 4q35 alleles, hypomethylation of D4Z4 arrays indicative of FSHD2, and SNPs that are important for diagnostic use. This workflow is amenable to transitioning to clinical settings for an accurate, low-cost FSHD molecular diagnostic test that could be accessible worldwide. What is already known on this topicCurrently accepted genetic diagnostics for FSHD1 are complex and expensive and can mischaracterize certain complex genetic cases. These diagnostics all require high molecular weight genomic DNA typically freshly isolated from blood, highly specialized equipment, and additional testing for FSHD2, making FSHD diagnostics the most expensive among neuromuscular diseases and inaccessible to much of the world. However, the epigenetic status of the 4q35 and 10q26 D4Z4 repeat arrays, as determined by DNA methylation status using our bisulfite sequencing-based protocol, distinguishes genetically FSHD1, FSHD2, and non-FSHD samples. Additionally, since our protocol is PCR-based, it can utilize DNA isolated from multiple sources, including saliva and buccal swabs. What this study addsThis study validates the relevant DNA methylation signatures against several large cohorts of genetically-confirmed FSHD and non-FSHD samples and optimizes the DNA methylation data analysis for the greater accuracy required for diagnostic utility, including the exclusion of nonpathogenic chromosome 10q or 4A166 contractions. In addition, we introduce the D4Z4caster analysis software, which runs in a portable and scalable Docker container, and provides increased quantitative accuracy important for: 1) confirming likely clinical cases of FSHD that do not meet the currently accepted genetic definition of FSHD1 or FSHD2, 2) identifying FSHD1 somatic mosaicism, and 3) potential prognostic applications. How this study might affect research, practice or policyFSHD1 is genetically defined by a D4Z4 array at the 4q35 locus that is contracted to 1-10 repeat units. However, disease penetrance is influenced by repeat number, epigenetic modifications, and genetic background, causing a misalignment of current genetic diagnosis with clinical diagnosis. This study will improve the accuracy of epigenetic analysis for determining cases of genetic FSHD, help broaden the definition of genetic FSHD to more accurately correspond to clinical FSHD, and allow identification of those at risk for developing clinical FSHD in affected families and in large population studies now being performed and proposed. In addition, it will better inform how an individuals epigenetic status is interpreted for potential prognostic value. Overall, this methodology is: 1) significantly less expensive than current clinically-approved FSHD diagnostic technologies, 2) more accessible due to compatibility with DNA isolated from multiple sources including saliva, and 3) compatible with the current sequencing equipment and workflow for DNA isolation used in commercial clinical laboratories. Together, these advantages will help move the technology toward becoming an approved molecular diagnostic test for FSHD in the USA, Europe, and countries currently lacking clear access to testing.

11
Understanding the Patient Journey, Current Treatment Approaches and Emerging Therapeutic Opportunities in CLN2 Batten Disease

Whiteman, I. T.; Villa, K. L.; Spector, C. M.; Cha, J.-H. J.; Fenton Parker, A.; Ahrens-Nicklas, R.; Schulz, A.; Yohrling, G. J.

2026-06-02 neurology 10.64898/2026.05.31.26354557 medRxiv
Top 0.2%
1.9%
Show abstract

Background CLN2 disease, Neuronal Ceroid Lipofuscinosis (NCL) type 2, is a rare, genetic neurodegenerative condition predominantly affecting children. CLN2 disease is characterized by seizures, language and motor decline, vision loss, and premature death. Currently, the only regulatory-approved therapy is the enzyme replacement therapy (ERT) Cerliponase alfa, administered fortnightly via intracerebroventricular infusion as a lifelong treatment. While ERT has been shown to slow motor and language decline, it is not curative and does not fully address disease progression, including retinal degeneration. To better understand the lived experience of affected families, and perspectives on current and emerging treatments, we conducted a community survey of parents and caregivers of individuals with CLN2 disease. Methods A 25-question anonymous, voluntary survey was distributed through the BDSRA Foundation and international partner patient advocacy organisations via email and social media. Eligible participants included current and bereaved parents or primary caregivers of individuals with CLN2 disease, regardless of treatment history. The survey explored treatment experiences, unmet needs, and knowledge of and attitudes toward emerging therapeutic approaches, particularly gene-based therapies. Results Ninety-eight respondents from 19 countries completed the survey. Fifty-seven respondents reported current or prior use of ERT, with 94.7% (n=54/57) actively receiving treatment at the time of survey. ERT was perceived to provide greatest benefit for motor function and seizure control; however, respondents reported substantial treatment burden (mean burden score 4.8/7, n=66). Despite treatment availability, 94.9% of respondents (n=75/79) indicated a need for alternative therapeutic options and 94.8% (73/77) expressed interest in learning more about gene therapy. Overall, 72.4% (n=55/76) reported they were likely or very likely to consider participation in an investigational gene therapy trial. Key factors influencing decision-making included potential safety risks (57.9%, n=44/76), preclinical safety and efficacy evidence (54.0%, n=41/76), and whether ERT discontinuation would be required to participate (54.0%, n=44/76). Conclusion While ERT has altered the treatment landscape for CLN2 disease, this survey highlights the ongoing disease burden and treatment challenges experienced by families. Findings demonstrate strong community interest in next-generation therapies that may reduce treatment burden and provide more comprehensive disease modification, including effects on both central nervous system (CNS) and ocular manifestations.

12
Rare neurological and neurodevelopmental variants in ALS link to onset, survival and family history

O'Donoghue, C.; Kacar, E.; Gomes, T.; Costello, E.; Pender, N.; Peelo, C.; Ryan, M.; Heverin, M.; Byrne, S.; Bede, P.; Hardiman, O.; McLaughlin, R. L.; Byrne, R. P.

2026-06-10 genetic and genomic medicine 10.64898/2026.06.09.26354977 medRxiv
Top 0.2%
1.9%
Show abstract

Background: Neurological, neuropsychiatric, and neurodevelopmental disorders cluster in ALS families, sharing a common genetic architecture with ALS. Pathogenic variants in genes associated with other neurological, neurodevelopmental, or neuropsychiatric disorders may also co-occur in ALS and modify phenotype. We have sought to determine the prevalence and clinical pattern of likely-pathogenic/pathogenic (LP/P) non-ALS neurological, neurodevelopmental, and neuropsychiatric variants, alone and in combination with ALS-gene variants, in two large ALS cohorts. Methods: Whole-genome sequencing (WGS) of 469 Irish and 774 Answer ALS people with ALS (pwALS) was analysed for ClinVar LP/P variants associated with other neurological (n = 15541), neurodevelopmental (n = 9761), and neuropsychiatric (n = 321) phenotypes. Inheritance patterns for associated genes (autosomal recessive/autosomal dominant) along with the associated phenotype were validated using OMIM. Standardised clinical data included family history, site and age of onset, El Escorial category, survival, motor decline, and cognitive and behavioural assessments. Known ALS-gene variants and C9orf72 repeat expansion status were included for each cohort. Results: Non-ALS neurological variants were identified in 47/469 (10.0%) Irish and 69/774 (8.9%) Answer ALS participants, most frequently in hereditary spastic paraplegia-associated genes (3.2% Irish; 2.8% Answer ALS). Irish neurological variant carriers showed higher frequency of respiratory onset (10.6% vs 1.2%, Fisher's exact p = 0.002, {Phi} = 0.20) and fewer premorbid behavioural symptoms (0.92 +/- 0.56 vs 3.08 +/- 0.97, Cohen's d = -0.40). Neurodevelopmental variants occurred in 12/469 (2.6%) Irish and 20/774 (2.6%) Answer ALS participants. In the Irish cohort, neurodevelopmental variant carriers had significantly shorter survival in Cox proportional hazards model (log-rank p = 0.005), corresponding to a more than two-fold increased hazard of death (HR = 2.25, 95% CI 1.26-4.00), and had significantly increased familial burden of neuropsychiatric disorders among first- and second-degree relatives (negative binomial IRR for carriers = 2.41, 95% CI: 1.12-5.18, p = 0.025). Across combined cohorts, 18 individuals (Irish n = 8; Answer ALS n = 10) carried [&ge;]2 LP/P variants spanning ALS and non-ALS genes. Conclusion: Rare LP/P variants in genes associated with other neurological and neurodevelopmental disorders occur in up to 12% of pwALS across two independent cohorts. Carriers show distinct phenotypes, shorter survival, and characteristic family history patterns. These findings suggest that extended pleiotropic and oligogenic architectures may contribute to ALS heterogeneity.

13
A Common Pathogenic Founder Variant in Rwandan Breast Cancer Cases

Manirakiza, A. V.; Baichoo, S.; Uwineza, A.; Dukundane, D.; Rugengamanzi, E.; Mutamuliza, J.; Niragira, A.; Muvunyi, R.; Besada, J.; Nielsen, S.; Bucknor, B.; Koeller, D. R.; Andrews, C.; Mutesa, L.; Fadelu, T.; Rebbeck, T. R.

2026-05-29 genetics 10.64898/2026.05.26.727861 medRxiv
Top 0.3%
1.8%
Show abstract

Germline data from African populations remain sparse, limiting characterization of population-specific BRCA1/2 pathogenic variants. In a study of 175 Rwandan women with breast cancer, 7 unrelated carriers (4% of cases; 22% of pathogenic variant carriers) harbored the same BRCA1 frameshift variant, c.4065_4068del (p.Asn1355Lysfs*10), which is extremely rare in gnomAD yet recurrent in European, Asian, and Middle Eastern cohorts. Whole-exome sequencing and haplotype analysis of all 7 carriers revealed a shared ancestral block of approximately 581 kb surrounding the variant, and extended haplotype homozygosity and network analyses confirmed a common founder origin. Coalescent-based age estimation placed the founder event approximately 4,000--10,000 years ago. Comparison with 1000 Genomes Project data showed the founder haplotype is absent or exceedingly rare outside African and South Asian populations. These findings strongly suggest the c.4065_4068del variant as a pre-historical BRCA1 founder variant in Rwanda, with implications for targeted genetic testing, cascade screening, and cancer prevention in the region.

14
GWAS Meta-analysis Identifies Novel Associated Loci and Points to Causal Tissues in Central Serous Chorioretinopathy

Chen, L.; Kim, S. H.; Truong, B.; Rämö, J. T.; Gorman, B. R.; van Dijk, E. H. C.; Brinks, J.; Nikopensius, T.; Choi, S. H.; Kajanne, R.; Mehtonen, J.; Kaarniranta, K.; Sobrin, L.; Kurki, M.; Yzer, S.; VA Million Veteran Program, ; FinnGen, ; Wu, W.-C.; Turunen, J. A.; Segre, A. J.; Mercader, J. M.; Huerta, A.; Daly, M. J.; Palotie, A.; Ellinor, P. T.; Boon, C. J.; Iyengar, S. K.; Peachey, N. S.; Natarajan, P.; Rossin, E. J.

2026-05-22 ophthalmology 10.64898/2026.05.20.26353693 medRxiv
Top 0.3%
1.7%
Show abstract

Objective: To define CSC genetic architecture and identify implicated ocular tissues, cell types, genes, and circulating proteins. Data Sources: Genome-wide data were assembled from FinnGen, All of Us, Mass General Brigham Biobank, Million Veteran Program, and a Dutch chronic CSC cohort. Serum protein quantitative trait loci, human single-cell ocular atlases, and UK Biobank macular optical coherence tomography (OCT) imaging were used for downstream analyses. Study Selection: Five European-ancestry cohorts with genome-wide data and cohort-specific CSC case-control definitions were included, comprising 2,584 cases and 1,044,455 controls. Variants present in at least 2 cohorts were meta-analyzed. Data Extraction and Synthesis: Cohort-level GWASs were adjusted for age, age squared, sex, genotyping array or batch, and 10 genetic principal components, then combined using fixed-effects inverse-variance meta-analysis. Post-GWAS analyses included gene prioritization, colocalization, Mendelian randomization, single-cell disease-relevance scoring, and testing of a CSC genetic risk score in UK Biobank OCT images. Main Outcome(s) and Measure(s): Genome-wide significant CSC loci, effector genes and proteins, tissue and cell-type enrichment, and CSC-relevant OCT abnormalities. Results: Across 11,068,938 variants, 10 loci reached genome-wide significance (P < 5e-8), including 3 novel loci near TGFB1, LINC00551, and LOC105375630 and 7 replicated loci near CFH, CD46, NOTCH4, PREX1, PTPRB, GATA5, and TNFRSF10A. Integrative analyses prioritized 10 candidate effector genes. Colocalization and Mendelian randomization implicated circulating TNFRSF10A, TGFB1, and CASP10 levels. Single-cell analyses localized genetic risk to sclera (P = 2.0e-4) and vascular endothelial cells (P = 4.0e-4), with fibroblast enrichment. In UK Biobank, OCT abnormalities were more frequent in the top vs bottom 1% of CSC genetic risk (18 of 109 [16.5%] vs 8 of 134 [6.0%]; odds ratio, 4.05; 95% CI, 1.65-10.87; P = .002). Conclusions and Relevance: In this GWAS meta-analysis, CSC susceptibility localized predominantly to scleral and vascular biology rather than primary retinal pigment epithelial dysfunction. These findings support CSC as a sclerovascular disorder and nominate complement regulation, endothelial signaling, and extracellular matrix pathways for future study.

15
Contextualizing the Utility of Polygenic Risk Scores using Absolute Risk Models in Diverse Ancestry Populations

Chatterjee, N.; Martina, F.; Kachuri, L.; Natarajan, P.; Witte, J.; Huo, D.

2026-06-04 genetic and genomic medicine 10.64898/2026.06.03.26354842 medRxiv
Top 0.3%
1.7%
Show abstract

Polygenic risk scores (PRSs) are emerging as powerful tools for quantifying inherited risk for common diseases and, in some cases, are approaching clinical implementation. A major concern for PRS implementation is their limited accuracy in non-European populations, particularly in those of African ancestry. However, past evaluations have focused on metrics such as relative risk or AUC, which do not capture background risk arising from contextual factors. We introduce a novel measure of variable importance, the conditional average derivative estimator (CADE), to evaluate PRS utility across diverse contexts and populations within absolute risk models that integrate PRSs with other relevant risk factors. We illustrate this framework by integrating PRSs for breast and prostate cancer within age-specific absolute risk models for incidence and mortality fit using individual-level data from the All of Us Research Program with inputs from the National Cancer Institute SEER cancer registry. Our projections show that although the PRSs are known to have the lowest discriminatory accuracy in African Americans (AA), there are contexts in which they provide greater utility, such as for the stratification of prostate cancer risk and mortality, where the CADE values for AA were 2- and 7-fold higher than for European Americans. These findings suggest that conclusions about the limited clinical utility of PRS in non-European populations may be premature and underscore the need to quantify PRS risk-stratification utility at the absolute-risk level, while accounting for disease onset, survival, and broader health and economic factors.

16
Evaluation of the Contribution of Natural Selection to Greater Cardiometabolic Disease Risk in South Asian Populations

Searby, D. J. C.; Hemani, G.; Chong, A.; Lawson, D. J.; Chaturvedi, N. J.; Davey Smith, G.

2026-05-22 genetic and genomic medicine 10.64898/2026.05.15.26353234 medRxiv
Top 0.3%
1.5%
Show abstract

A greater genetic susceptibility has been proposed as an explanation of the greater rates of cardiovascular and metabolic disease in South Asian relative to European populations. We first demonstrate that after accounting for technical artefacts the genetic effects for related traits are largely consistent between ancestral groups, which downplays the role of GxG or GxE interactions driving differential prevalence. If higher genetic susceptibility in South Asians is due to selective pressures acting through adiposity-related traits in the evolutionary past, signatures of selection should be evident at loci associated with cardiometabolic disease and other causally related traits (e.g. fat distribution). We tested for enrichment of several selection statistics (FST, XP-EHH and XP-nSL) at loci associated with a range of traits related to cardiometabolic disease, in comparison to a null distribution of linkage disequilibrium (LD) score and minor allele frequency (MAF) matched SNPs. Loci associated with a subset of these traits (Type 2 diabetes mellitus, trunk fat percentage, body fat percentage and trunk fat mass) exhibited enrichment for FST, consistent with a moderate adaptive explanation for their cross-population differentiation. In contrast, none of the studied traits were enriched for haplotype-based statistics, indicative that cross population genetic divergence is unlikely to have been driven by recent selective sweeps but has rather likely arisen from either ancient selection or recent polygenic selection acting on standing variation.

17
Inter-relationship of Retinal, Choroidal, and Scleral Thickness in High Myopia

Panigrahi, S.; Dhakal, R.; Vupparaboina, K. K.; Verkicharla, P. K.

2026-05-17 ophthalmology 10.64898/2026.05.13.26353083 medRxiv
Top 0.3%
1.4%
Show abstract

Purpose Considering that myopia is associated with thinning of the ocular coats, this study investigated the inter-relationship of retinal, choroidal and scleral thickness in foveal regions in Indian high myopes. Methods A total of 23 high myopes (spherical equivalent refraction [&le;]-6.00D) aged 16 to 35 years underwent posterior segment imaging with swept-source optical coherence tomography. The retinal, choroidal and scleral thickness was determined using semi-automated custom-designed software at sub-foveal regions. Axial length was determined using Lenstar LS 900 non-contact biometer. Results The mean plus-or-minus sign SD axial length was 30.17 plus-or-minus sign 2.23 mm, sub-foveal retinal thickness was 245 plus-or-minus sign 28 lower case Greek mum, sub-foveal choroidal thickness was 82 plus-or-minus sign 46 lower case Greek mum, and sub-foveal scleral thickness was 254 plus-or-minus sign 68 lower case Greek mum. The choroid was significantly thinner compared to the retina and sclera (p<0.001). With a 1 mm increase in axial length, there was no significant variation in sub-foveal retinal (increased by 0.86 lower case Greek mum) and scleral thickness (decreased by 4.31 lower case Greek mum, p[&ge;]0.05), but sub-foveal choroidal thickness decreased by 10.35 lower case Greek mum (p=0.02). For a 1D decrease in spherical equivalent refraction, the choroidal thickness reduced significantly (decreased by 5.88 lower case Greek mum, p<0.001), while there was no significant variation in retinal (decreased by 0.68 lower case Greek mum, p=0.55) and scleral thickness (increased by 0.13 mum, p=0.98). The association of the sub-foveal retinal, choroidal, and scleral thickness was weak and was not significant in high myopes (p[&ge;]0.10). Conclusions With increasing axial length and severity of myopia in high myopes, compared to scleral and retinal thickness, the choroidal thickness alone decreased significantly. Our findings indicate that the changes in the choroid do not necessarily reflect the changes in retinal and scleral thickness and highlight the importance of the choroid as a marker for axial elongation even in high myopes.

18
Changes in the profile of adults diagnosed as autistic since 2010: population based studies in England and Sweden

Sadik, A.; Lundberg, M.; Khandaker, G. M.; Pardinas, A. F.; Lee, B. K.; Madley-Dowd, P.; Magnusson, C.; Rai, D.

2026-05-28 epidemiology 10.64898/2026.05.20.26353486 medRxiv
Top 0.3%
1.3%
Show abstract

Objective: To understand if sociodemographic and neuropsychiatric characteristics of people diagnosed with autism in the United Kingdom (UK) and Sweden have changed since 2010. Design: Cross-context population-based cohort studies. Setting: UK primary care records from 2010-2023 and Swedish population-wide register linkages from 2010-2021 Participants: 24,537,039 individuals age 16 or over, registered with general practices in the UK, including 141,119 with an autism diagnosis. 9,096,874 people age 16 or over in the Swedish Total Population Register, including over 100,817 with an autism diagnosis. Main outcome measures: Annual age-standardised incidence and prevalence of adult autism diagnoses within different sociodemographic groups. Annual age-standardised proportion of adults with new autism diagnoses, lifetime autism diagnoses, and no autism diagnoses, with prior records of other neuropsychiatric conditions or medications. Results: Incident adult autism diagnoses were consistently higher in Sweden than the UK, however incidence increased rapidly in the UK after 2020. Incident diagnoses increased fastest for 16-25-year-olds and females in both nations, as well as people in White ethnic groups in the UK and people with Swedish-born parents in Sweden. For example, in the UK in 2023 the age-standardised incidence of autism diagnoses among 16-65 years olds was 11 diagnoses per 10,000 person-years (95%CI: 10.7, 11.3) in the White ethnic group and 2.2 diagnoses per 10,000 person-years (95%CI: 1.9, 2.5) in the South Asian ethnic group. Over time there has been a consistent decline in the proportion of autistic adults with a prior diagnosis of epilepsy, psychosis and intellectual disability and an increase in the proportion with a prior diagnosis of ADHD, anxiety, depression and several other mental illnesses. For example, in the UK between 2010 and 2023 the age-standardised proportions of newly diagnosed autistic adults with prior records of epilepsy decreased from 10% (95%CI: 7.6, 13) to 4% (95%CI: 3.6, 4.5), while the proportion with records of anxiety increased from 28.7% (95%CI: 24.4, 33.6) to 58.3% (95%CI: 56.6, 60.1). Mental health conditions were generally more common in females and the reduction over time in intellectual disability was greater in females than males. Conclusions: The socio-demographic and neuro-psychiatric characteristics of individuals diagnosed as autistic have changed dramatically since 2010, a phenomenon observed both in the UK and Sweden. The extent to which these changes indicate nuanced recognition of autism or broadening of diagnostic practice needs investigation.

19
Neovascular Glaucoma at a Tertiary Centre in Finland, 2008-2024: A Retrospective Cohort Study

Simons, G.; von Fersen, M.; Summanen, P.; Harju, M.

2026-06-02 ophthalmology 10.64898/2026.06.01.26354330 medRxiv
Top 0.3%
1.3%
Show abstract

Background/Aims: Neovascular glaucoma (NVG) is an aggressive secondary glaucoma with limited longitudinal data. This study reports the aetiologies, treatments, and longitudinal outcomes in NVG. Methods: Patients with NVG were identified through electronic medical record review. Inclusion required documented rubeosis of the iris and/or anterior chamber angle, intraocular pressure (IOP) [&ge;]25 mmHg, diagnosis during 2008-2024, and follow-up at Helsinki University Hospital. Baseline data and all follow-up visits were included. Results: Of 919 patients identified, 626 met inclusion criteria, with a median follow-up of 24 months. The estimated NVG incidence was 2.2/100,000/year. The most common aetiology was central retinal vein occlusion (CRVO; 45%), followed by diabetic retinopathy (DR; 14%), central retinal artery occlusion (CRAO; 11%), and ocular ischaemic syndrome (8%). Half of patients had hand motion vision or worse at baseline, with 18% at no light perception (NLP). At 5 years, 13% of patients had Snellen 6/60 vision or better. Visual outcomes differed by aetiology, with median time to NLP ranging from 1.6 (CRAO) to 9.1 (DR) years (log-rank p=0.002). Median baseline IOP was 40 mmHg, decreasing to 21 mmHg by 1 year. Ocular pain fell from 43% at baseline to 11% at last follow-up. Structural eye loss (e.g., enucleation or phthisis) occurred in 3% by 5 years. Conclusion: The estimated incidence was lower than previously reported elsewhere. Unlike other cohorts where DR predominates, CRVO was the most common aetiology, and visual prognosis was strongly aetiology-dependent. Glaucoma drainage device surgery reached 7.6% at 3 years, despite the severity and refractory nature of NVG.

20
Stratified evaluation of blood RNA sequencing in a rare disease cohort

Duzenli, T.; Durmus, S.; Kaya, H. E.; Sevilgen, F. E.; Kayhan, G.; Cakir, T.; Ergun, M. A.

2026-05-28 genetic and genomic medicine 10.64898/2026.05.27.26353804 medRxiv
Top 0.3%
1.3%
Show abstract

Background: RNA sequencing (RNA-seq) is increasingly recognized as a complementary tool to DNA-based sequencing for improving the diagnostic yield in Mendelian disorders. However, how the diagnostic performance of RNA-seq varies across molecularly and phenotypically distinct patient subgroups remains poorly defined. This study aimed to evaluate and compare the diagnostic utility of RNA-seq across three stratified groups of patients with non-diagnostic exome sequencing. Methods: We performed RNA-seq on whole blood samples from 90 patients with suspected Mendelian disease in whom clinical exome or whole-exome sequencing had failed to establish a molecular diagnosis. Patients were prospectively stratified into three groups of 30: (i) patients with a candidate variant of uncertain significance (VUS) with predicted splicing impact (Group 1), (ii) patients with a specific clinical pre-diagnosis but no identified pathogenic variant (Group 2), and (iii) patients without a specific pre-diagnosis or candidate variant (Group 3). Aberrant splicing, gene expression outliers, and allele-specific expression were analyzed using multiple bioinformatic tools and compared against a GTEx-derived control cohort. Results: RNA-seq contributed to a molecular diagnosis in 29 of 88 evaluable patients (32.9%). Diagnostic yield differed substantially across groups: 82.8% (24/29) in Group 1, 6.9% (2/29) in Group 2, and 10% (3/30) in Group 3. In Group 1, RNA-seq enabled reclassification of candidate VUS through direct demonstration of aberrant splicing events. In Group 2, RNA-seq identified a somatic mosaic ACTB variant missed by exome sequencing and reclassified a previously deprioritized APPL1 VUS. In Group 3, a deep intronic pseudoexon-activating variant in IGBP1 was identified in two siblings with severe microcephaly, providing evidence for a candidate X-linked microcephaly gene, and a pathogenic RNU4-2 variant was detected in a patient with ReNU syndrome, a non-protein-coding gene not captured by standard exome sequencing. Conclusions: RNA-seq has the highest diagnostic utility when applied to evaluate candidate splice variants identified by prior DNA testing but also provides independent diagnostic value in patients without candidate variants. The systematic comparison across stratified patient groups supports the integration of RNA-seq into clinical genomic workflows and highlights the need for standardized analytic frameworks.